When GL meets the corpus: a data-driven investigation of lexical types and coercion phenomena

نویسندگان

  • Elisabetta Jezek
  • Alessandro Lenci
چکیده

In this paper we present an analysis of corpus-derived V-arg combinations aiming to provide a datadriven characterization of Lexical Types (LTs) and represent how types behave compositionally, i.e. how they enter compositional processes and are modulated by them. We will do so using the enriched compositional rules and the type system as presented in Pustejovsky (2006). Our main concerns are twofold: i.) first of all, we want to show with a specific case-study (§. 5 onwards) how a data-driven investigation can shed light on the structure and the combinatorics of LTs; ii.) starting from the results of this investigation, we intend to propose a general methodology for lexical modeling in which the Generative Lexicon (GL) theory and corpus analysis are deeply interwoven in a process of mutual feeding. In fact, we argue that, if on the one hand corpus data can help to anchor the study of lexical dynamics and type system on empirical evidence, on the other hand GL can provide the crucial interpretative key for corpus data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lexical Bundles in English Abstracts of Research Articles Written by Iranian Scholars: Examples from Humanities

This paper investigates a special type of recurrent expressions, lexical bundles, defined as a sequence of three or more words that co-occur frequently in a particular register (Biber et al., 1999). Considering the importance of this group of multi-word sequences in academic prose, this study explores the forms and syntactic structures of three- and four-word bundles in English abstracts writte...

متن کامل

Developing a Corpus-Based Word List in Pharmacy Research ‎Articles: A Focus on Academic Culture

The present corpus-based lexical study reports the development of a Pharmacy Academic Word List (PAWL); a list of the most frequent words from a corpus of 3,458,445 tokens made up of 800 most recent pharmacy texts including research articles, review articles, and short communications in four sub-disciplines of pharmacy. WordSmith (Scott, 2017) and AntWordProfiler (Anthony, 2014) were used to sc...

متن کامل

Concordance-Based Data-Driven Learning Activities and Learning English Phrasal Verbs in EFL Classrooms

In spite of the highly beneficial applications of corpus linguistics in language pedagogy, it has not found its way into mainstream EFL. The major reasons seem to be the teachers’ lack of training and the unavailability of resources, especially computers in language classes. Phrasal verbs have been shown to be a problematic area of learning English as a foreign language due to their semantic op...

متن کامل

Incorporating Polarity in Lexical Resources

The felicity of contrast and parallel relations as well as structures such as ellipsis are in part dependent on the presence of lexical elements contrasting or sharing. Traditional dictionaries and lexical resources like WordNet code information about antonyms, yet this is often not the type of polarity found in real contrast or parallel examples. It is not yet clear how negative or positive po...

متن کامل

Numerical Investigation of Double- Diffusive Mixed Convective Flow in a Lid-Driven Enclosure Filled with Al2O3-Water Nanofluid

Double-diffusive mixed convection in a lid-driven square enclosure filled with Al2O3-water is numerically investigated. Two-dimensional nonlinear governing equations are discretized using the control volume method and hybrid scheme. The equations are solved using SIMPLER algorithm. The results are displayed in the form of streamlines, isotherms, and iso-concentrations when the Richardson number...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007